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■^ ' Abstract 

^— N ' Let W{n) be either the number of descents or inversions of a permutation tt G 5„. Stein's 

rvq , method is applied to show that W satisfies a central limit theorem with error rate n^^". The 

wp.' construction of an exchangeable pair {W, W) used in Stein's method is non-trivial and uses a 

I— { , non-reversible Markov chain. 

<■ 

[^ ■ 1 Introduction 

T-H ■ 

,— ,■ We begin by recalling two permutation statistics on the symmetric group Sn which are of interest to 

P^ , combinatorialists and statisticians. A good introduction to the combinatorial aspects of permuta- 

Ph I tion statistics is Chapter 1 of Stanley [Sta] , and a superb account of their applications to statistical 

,J^ ' problems is Chapter 6 of Diaconis [Di] . 

C^ . The first statistic on Sn is Des{TT), the number of descents of vr. This is defined as the number 

H I of pairs (i, i + 1) with 1 < i < n — 1 such that 7r(i) > 7r(i -|- 1). Writing tt in two-line form, this is the 

'""'■ number of times the value of the permutation vr decreases. (A more general definition of descents 

^\j . exists for Coxeter groups: the number of height one positive roots sent to negative roots by tt). 

K^ I The number of permutations vr in 5„ with k + 1 descents is also called the Eulerian number A{n, k) 

..jj. ' and has been studied extensively [DiP], [FSc], [K]. Several proofs are known for the asymptotic 

(S| ' {n -^ c>o) normality of A(n,k). See for instance Diaconis and Pitman [DIP], Pitman [P], Bender 

CN , [Be], and Tanny [T]. A proof using the method of moments should also work. 

1^^ I A second well-studied statistic on 5„ is /nw(vr), the number of inversions of vr. In the statistics 

Q\ • community this is called Kendall's tau. Inv is defined as the number of pairs {i,j) with 1 < 

fH . i < j < n such that vr(i) > vr(j). Writing vr in two-line form, this is the number of pairs (i,j) 

whose values are out of order, /(vr) is also the length of vr in terms of the standard generators 

{{i,i + l) : 1 < i < n — 1} for Sn- (For an arbitrary Coxeter group, Inv^n) is the number of positive 

• • . roots sent to negative roots by vr). Proofs of the asymptotic normality of /nw(vr) for Sn can be 

. ^ I found in Bender [Be] and Chapter 6 of Diaconis [Di] . 

p\ ' The following definition generalizes both of these statistics. Let M = (Mij) be a real, anti- 

j^ ■ symmetric, n *n matrix. Let X be the random variable on Sn defined by X(vr) = J2i<j ^n{i),n{j)- 

Setting Mi J = — lifj=i-|-l, Mij = 1 if j = i — 1, and Mij = otherwise leads to ^(vr) = 

2£)es(vr-^) - (n - 1). Setting Mij = -1 if i < j, Mij = +1 if i > j, and Mi^i = leads to 

X(it) = 2Inv(7r^^) — (!!). Define W = , , so that W has mean and variance 1. 

Charles Stein developed a method for bounding the sup norm between the distribution of a 
random variable and the standard normal distribution. His technique has come to be known as 
Stein's method. Stein's book [Ste] and the papers in this volume are good references. 

Let us recall some notation from probability theory. If Y, Z are random variables on a probability 
space (i7, B, P), we let E{Y) denote the expected value of Y and E^{Y) the expected value of Y 
given Z, where both expectations are taken under P. In the case at hand, 0, is Sn, B is all subsets 
of Sn, and P is the uniform distribution. Call W, W an exchangeable pair of random variables on 

Sn if P{W = Wi,W' = U;2) = P{W = W2, W = Wi). 

Theorem 1 is due to Rinott and Rotar. 



C^ 



Theorem 1 ([RR]) Let W, W he an exchangeable pair of real random variables such that E^W' = 
(1 — \)W with < A < 1. Suppose moreover that \W' — W\ < A for some constant A. Then for 
all real x, 

\P{W < x] - ^{x)\ < ^JVar{E^{W' - W)^) + 48— + 8^ 

A * A vA 

where <& is the standard normal distribution. 

Theorem 1 will be used to prove Theorem 2. 

Theorem 2 Let Des{n) and Inv{7r) be the number of descents and inversions of tt £ Sn- Then 
for all real x, 

\P{ r-r' <x}-Hx)\<\ 

\p{ /;\^^/^ <x}-.i>(x)i<4 

' ?i(ra— l)(2?i+5) j^2 



72 

where C is a constant independent of n. 

We remark that Theorem 2 is known by other proof techniques (see [DiP] for the case of 
descents and [Bic] for inversions). We recently learned that there is some overlap with results in 
[BCLZ], which gives bounds for permutation statistics using reversible Markov chains together with 
Bolthausen's variation of Stein's method. 

Section 2 shows how, for W = Des or W = Inv, to construct an exchangeable pair (W, W) 
such that E W' = (1 — -)W. This step, which is usually the easy part of applying Stein's method, 
is non-trivial and uses a non-reversible Markov chain equivalent to the "move to front" chain. The 
only other example in the literature in which exchangeability was not obvious is the paper of Rinott 
and Rotar [RR]. A connection with this work will be mentioned in Section 2. Section 3 develops 
bounds for the terms on the right-hand side of Theorem 1, and indicates why a somewhat weaker 
version of Theorem 1 due to Stein can only give n~^'^ rates. 

We remark that the move to front rule on the symmetric group is a very special case of a theory of 
random walk on the chambers of real hyper plane arrangements [BiHaR] . The corresponding Markov 
chains are non-reversible and have real eigenvalues. These nonreversible chains have recently been 
related to a reversible Markov chain on the set of irreducible representations of the symmetric group 
[F1],[F2]. 

2 Construction of an Exchangeable Pair (W, W) 

This section constructs W so that (W, W) is an exchangeable pair with nice properties. In most 
applications of Stein's method (e.g. the examples in Stein [Ste]), it is clear how to define W and 
exchangeability comes for free. The situation here is more subtle. 

This being said, define W = W'{ti) as follows. Pick / uniformly at random between 1 and n and 
define vr' as (/, / + 1, • • • , n)7r, where (/, / + 1, • • • , n) cycles by mapping /^/ + l^---^n^/, 
and where permutation multiplication is from left to right. For example, suppose that n = 7 and 
/ = 3. Then the permutation n which in 2-line form is: 



is transformed to: 



i : 1 2 3 4 5 6 7 
7r(i) : 6 4 1 5 3 2 7 



12 3 4 5 6 7 
6 4 5 3 2 7 1 



In other words, one moves the number in position / in the second row of vr to the end of this 
second row. Now define W'{tt) = W{tt'). Before discussing exchangeabihty, we prove Lemma 1, 
which was the motivation for the definition of W' and shows that one can take A = - in Theorem 
1. 

Lemma 1 E^W = (1 - ^)W 

Proof: Letting i be the value of the random variable /, ones sees from the definition of W' that: 

1 



1 1 



yTM^^^,t>. 



2 



E -2M 



7r(i),7r{jr') 



-w. 

n 
Since E'^iW' — W) depends on vr only through W , the lemma follows. □ 

Lemma 2 establishes a condition on (Mij) under which the pair (W, W') is exchangeable. This 
condition admittedly has limited scope, but as will be seen, holds for the cases of descents and 
inversions. 

Lemma 2 Given a subset S of {!,• • • ,n}, for each i £ S define Oi^s = J2jeS:j>i^i,j '^"-^ ^^.•S' — 
J2jeS:j<i ^j,i- Suppose that for all subsets S o/ {1, • • • , n} , there is a bijection Q : S ^^ S satisfing 
the following conditions: 

1. For each i £ S, Ui^s - bi,s = be(^i)^s - "'e{i),S- 

2. For each i £ S, there is a bijection ^i : S — {i} ^^ S — {@{i)} such that Mj^k = ^<J>i{i),<J>i{fc) 
for all j,k £ S — {i}. 

Then {W, W) is an exchangeable pair of random variables. 

Proof: It will be shown that P{W = a,W' = b} = P{W = b,W' = a}. For this we prove the 
stronger claim that ii T = {tt £ Sn '■ 7r(j) = Zj for 1 < j < / — 1}, then 

P{W = a,W' = b\I, TT£T} = P{W = b,W' = a, \I, tt £ T} 

In other words, assume that the value of / and the images of {1, •••,/— 1} under tt are given. 

Let S = {'7r(/), • • • , vr(n)} be as in the hypotheses of the lemma. Now define a bijection A : T i-^ T 
as follows: 



1. A(7r)(i)=^(j)for l<j</-l 

2. A{n){I) = e{7r{I)) 

3. A(7r)(i) = <!>.(/) (^(j)) for / + 1 < i < iV 

We only show that W{tt) = M^(A(7r)'), the argument that W{tt') = W{A{7:)) being similar. 
Since vr and A(7r)' agree on 1, •••,/— 1, it is enough to show that 

Z2 ^■^iI),-^U) + Z2 ^7T{i),n{j) = 2J ^A(7r)'(i),A(7r)'(j) + 2_/ ^A W(i),A(7r)'(n) • 
I<j<n I<i<j<n I<i<j<n I<i<n 

Now observe that 

I<i<j<n I<i<j<n 

H M$^(,)(^(i)),1'.(/)(^(i)) 
I<i<j<n 

I<i<j<n 

The second equality is from the definition of A(7r) and the third equality is from condition 2 in the 
lemma. Also observe that 

J2 ^A(7r)'(i),A(7r)'(n) = J2 ^A(7r)(i),A(7r)(7) 

I<i<n ^<i<™ 

= J2 ^A(^)(j),0(^(/)) 
I<j<n 

= ^0(7r(/)),5 - ae(7r{7)),s 
= o-^{i),s - K{i),s 

I<j<n 

The third equality holds because {A(7r)(j) : I < j < n} = S — Q{tt{I)). The fourth equality is from 
condition 1 in the lemma. □ 

Remarks 

1. Let us illustrate the proof of Lemma 2 by example for X{7:) = 2Des{7:^^) — (n — 1). Recall 
that here Mij = —1 ifj = i + 1, Mij = 1 if j = i — 1, and Mjj = otherwise. Suppose 
that / = 3 and 7r(l) = 6,7r(2) = 4. Thus T = {vr G 5„ : 7r(l) = 6,7r(2) = 4}. Note that 
S = {1,2,3,5,7}, because these are the images of 7r(j) for j > I = 3. One observes that 
the bijection Q : S ^ S defined by G(l) = 3, 0(2) = 2, 6(3) = 1, 0(5) = 5, 6(7) = 7 
satisfies condition 1 of Lemma 2 (in general, one defines by reversing within each group of 
consecutive numbers in S). For each i G 5 it is also necessary to define bijections $i such 
that condition 2 of Lemma 2 holds. This can be done by pairing the elements of S" — {i} and 
S — {@{i)} so as to preserve their relative order. For instance, <l>i : {2,3,5,7} ^^ {1,2,5,7} 
is defined by $i(2) = 1, $i(3) = 2, $i(5) = 5, $i(7) = 7. 

These choices determine the bijection A : T ^ T constructed in Lemma 2. For example, 
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: 1 
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5 
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tt{i) 
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2 
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K'il) 


: 6 


4 


5 


3 


2 


7 


1 



i 12 3 4 5 6 7 

A(7r)(i) 6 4 3 5 2 17 



i 12 3 4 5 6 7 

A(7r')(i) 6 4 5 2 17 3 

One checks that ^(vr) = X(A(7r')) = and X{tt') = X{A{tt)) = 2. 

2. Let us illustrate the proof of Lemma 2 by example for X(7r) = 2/nf(7r~^) — (2). Here 
Mi J = — 1 if i < J, Mjj- = +1 if i > j, and Mi^j = 0. As for the case of descents, suppose that 
/ = 3 and 7r(l) = 6, 7r(2) = 4. Then T = {tt G S'n : vr(l) = 6, 7r(2) = 4} and S = {1, 2, 3, 5, 7}. 
The bijection Q : S ^^ S must be defined differently from the descent case so that condition 
1 of Lemma 2 holds. It is easy to see that reversing the elements of S works. Thus G(l) = 7, 
6(2) = 5, 0(3) = 3, 0(5) = 2, and 6(7) = 1. Defining the maps <l>j as in the descent case, 
condition 2 of Lemma 2 holds. 

These choices determine the bijection A : T ^ T constructed in Lemma 2. For example, 

i 12 3 4 5 6 7 

A(7r)(i) 6 4 7 3 2 15 
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K'{i) 


: 6 


4 


5 


3 


2 


7 


1 



i 12 3 4 5 6 7 

A(7r')(i) 6 4 3 2 15 7 

One checks that X{t:) = X{k{-K')) = 1 and X{-k') = X{K{-k)) = 9. 

3. The above examples show that the pair (W, W') is exchangeable for descents and inversions. 
An interesting problem is to classify the matrices {Mij) such that the pair (PF^, W^') is ex- 
changeable. It would also be useful to construct exchangeable pairs (W, W') for other Coxeter 
groups. 

4. Lemma 1.1 of [RR] states the following. Suppose that {T*} is a stationary, nonnegative, 
integer valued process satisfying T*^^ — T* = +1, or —1. Then (T*, T''^^) is an exchangeable 
pair. 

For the case of descents, this gives an alternate proof that W, W as we have defined them are 
an exchangeable pair, even though the underlying chain on permutations is not reversible. 
To see this, let R^ be a uniformly distributed element of 5„; then given i?*, move to i?*+^ 
according to the move random to end rule defined in the beginning of this section. This 
process is stationary. Defining T* to be the number of descents of i?*, one sees that the 
conditions of the lemma hold. 

It is interesting to note that Lemma 1.1 of [RR] was applied there to study W equal to the 
number of ones in a random pick from the stationary distribution of the antivoter model. The 
antivoter chain is not reversible, but their lemma implies that if W is the number of ones 
after a step from the antivoter chain, then (W, W') is an exchangeable pair. 



3 Bounding the Error Terms 

This section bounds the error terms on the right hand side of Theorem 1. 

We start by computing the mean and variance of X and estabUshing a nice property of the pair 
{W, W). For this it is helpful to define Ai = J2j>i Mij and Bi = J2h<i ^Ki- 



Lemma 3 E{X) = and Var{X) 



E,<,w..)^+ELi(-4^-^»)^ 



Proof: Observe that the random variable X on Sn can be written as a sum of random variables 
Xi^j on Sn- Defining a random variable Xij on Sn by 









one has that: 








X(7r): 


i<j 7r-l(j)<7r-l(j) i<j 



The mean of X is since each Xij has mean and expectation is linear. 

The variance of X is equal to E[{J2i<j -^ij(^))^]- The terms E{XfA contribute (Mij)"^ each and 
thus J2i<j{^i,j)'^ i'^ total. The terms E{2XijXk^i) vanish if i,j, k, I are distinct, by independence. 
Now consider what happens when two of these four indices are equal. Terms of the form 2E(XijXi^i) 
contribute |MjjMj^; each. The sum of all such terms can be rewritten as |Ej^f ~ J2i<j{^i,j)'^]- 
Similarly, terms of the form 2E{Xi^iXki) contribute ^[J^i^i ~ J2i<j{Mi,j)'^]- Finally, terms of 
the form 2E{XijXj^k) contribute — |MjjMj^fc each, and hence a total of —"^^^AiBi. The lemma 
follows. □ 

As a consequence of Lemma 3, one recovers the known facts that for a random permutation on 
n symbols, Var{Des{iT)) = '^^^j^r ^^^ Var{Inv{iT)) = "-^"-~ iv " — l^ Note that Lemma 3 has written 
Var{X) as a sum of positive quantities. 

Lemma 4 E{W' - Wf = ^ 

Proof: 

E{W' -Wf = EiE'^i.W'-Wf) 



E \E^{{W'f + W^ - 2WW'] 
E ({W'f + E{W^) - 2WE^{W'\ 



2Var{W) - E{2WE^{W')) 
^-Var{W) 



n 
4 

n 



The fourth equality used the fact that W and W' have the same distribution. The fifth equality 
used Lemma 1. □ 

Lemma 5 establishes a well known inequality. For completeness, we include a proof. 



Lemma 5 ElE"^ {W - Wff < E[E-^{W' - Wf]'^. 

Proof: Jensen's inequality says that if gr is a convex function, and Z a random variable, then 
g{E{Z)) < E{g{Z)). There is also a conditional version of Jensen's inequality (Section 4.1 of 
Durrett [Du]) which says that if F is any a subalgebra of B, then 

E{g{E{Z\F))) < E{g{Z)). 

The lemma follows by applying this inequality to the case g{t) = t^, Z = E'"{W' — W)"^, B is all 
subsets of Sn, and F is the a subalgebra of B generated by the level sets of M^. □ 

Now we prove Theorem 2. 
Proof: (of Theorem 2) We will apply Theorem 1. Note that the move random to end rule 
changes the number of descents by at most one. Hence the corresponding pair {W, W) satisfies 
\W' — W\ < , Similarly the move random to end rule changes the number of inversions 

by at most n-1. Hence the corresponding pair (VF, M^') satisfies \W' — W\ < . . Thus in 

both cases \W' — W\ is at most An'^'"^ for an absolute constant A. Also note by Lemma 1 that 
E^{W') = (1 - X)W with A = f . 

Thus by Theorem 1 the result will follow if it can be shown that Var{E^ {W' — W)^) < 
^. Lemma 5 implies that Var{E^{W' - Wf) < Var{E''{W' - Wf). Hence we show that 
Var{E''{W' - Wf) < ^ 






Observe that 



1 4 

Var{X) n ^ 



E-{w'-wf = T7-7T^-E(E-^^w,^o-): 



1 4 ^ " 



Since X^ILi J2j>ii^n(i),n{j))'^ is independent of vr, it follows that 



^ ' \ 1=1 j>i i=l i<ji<J2<n 



VariE^W'-Wf) = yJ^.,J E Far(M.(,),.(,,)M.(,),.(,,)) 

^ ' l<i<ji<J2<n 

+ E C'of(M^(j)^„(j^)M^(j)_^(j2),M^(fc)^„(;^)M^(jfc)_^(i2))] 

!<Jl<J2.'=<'l<'2 

Let us analyze this bound for the case of descents (i.e. Mij = —1 ifj = i + 1, Mij = 1 if 
j = i — 1, and Mij = otherwise). We first study the summands and then divide by Var[X)'^n'^. 
The first summand has O(n^) terms, each contributing 0{n~'^); hence it is 0{n). The covariance 
terms are also 0{n). To see this, first note that the covariance vanishes if {i,ji,J2} H {k, li, I2} = 0, 
so such terms can be ignored. Suppose that i ^ k. Then there are Oijt') terms each contributing 
0{n~*). li i = k there are subcases to consider based on which (if any) of elements of {ji, J2} are 
equal to elements of {li,l2}- It is straightforward to see that in all cases the contribution of the 
covariance term is 0{n). Since Var{X) is ^^, it follows as desired that Var{E'^{W' — W)"^) < -3-. 

The case of inversions is similar. The variance terms contribute at most 0{n'^) and the covariance 
terms at most order 0{n^). Thus 



where Bq,B are universal constants. □ 

To conclude the paper, we comment on the following result of Stein [Ste]. 

Theorem 3 (Stein) Let W, W' be an exchangeable pair of real random variables such that E W' 
(1 — \)W with < A < 1. Then for all real x, 



\P{W <x}- ^{x)\ < 2Je[1 - -^E^iW - VF)2]2 + {2tt)-^JjE\W' - W\^ 
where <1> is the standard normal distribution. 
Applied to our exchangeable pair this would only yield bounds of order n~^'^, since by Jensen's 

/ \ 3/2 

inequality E\W' - W\^ > {E{W' - W)^)^/^ - ' ^ 
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